Search CORE

47 research outputs found

HMM-FRAME: accurate protein domain classification for metagenomic sequences containing frameshift errors

Author: A Kislyuk
AL Delcher
C Quince
C Wang
DT Gibson
E Birney
E Halperin
H Noguchi
I Antonov
K Karplus
M Borodovsky
M Girdea
M Larkin
M Pellegrini
M Peltola
M Rho
N Brown
N Eriksson
R Durbin
R Edwards
S Altschul
S Iwai
T Schiex
W Li
WI Chang
X Guan
Yanni Sun
Yuan Zhang
Z Zhang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Protein domain classification is an important step in metagenomic annotation. The state-of-the-art method for protein domain classification is profile HMM-based alignment. However, the relatively high rates of insertions and deletions in homopolymer regions of pyrosequencing reads create frameshifts, causing conventional profile HMM alignment tools to generate alignments with marginal scores. This makes error-containing gene fragments unclassifiable with conventional tools. Thus, there is a need for an accurate domain classification tool that can detect and correct sequencing errors. Results We introduce HMM-FRAME, a protein domain classification tool based on an augmented Viterbi algorithm that can incorporate error models from different sequencing platforms. HMM-FRAME corrects sequencing errors and classifies putative gene fragments into domain families. It achieved high error detection sensitivity and specificity in a data set with annotated errors. We applied HMM-FRAME in Targeted Metagenomics and a published metagenomic data set. The results showed that our tool can correct frameshifts in error-containing sequences, generate much longer alignments with significantly smaller E-values, and classify more sequences into their native families. Conclusions HMM-FRAME provides a complementary protein domain classification tool to conventional profile HMM-based methods for data sets containing frameshifts. Its current implementation is best used for small-scale metagenomic data sets. The source code of HMM-FRAME can be downloaded at <url>http://www.cse.msu.edu/~zhangy72/hmmframe/</url> and at <url>https://sourceforge.net/projects/hmm-frame/</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Seeds for effective oligonucleotide design

Author: A Califano
Anahita Mansouri Bigvand
B Ma
B Ma
F Li
H Nielsen
J Rouillard
J Rouillard
L Ilie
L Ilie
L Kaderali
L Noe
Lucian Ilie
M David
M Girdea
M Li
N Reymond
S Altschul
S Feng
S Rahman
S Rimour
Shima Khoshraftar
Silvana Ilie
T Smith
WH Chung
Y Chen
Z Bozdech
Z He
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Background: DNA oligonucleotides are a very useful tool in biology. The best algorithms for designing good DNA oligonucleotides are filtering out unsuitable regions using a seeding approach. Determining the quality of the seeds is crucial for the performance of these algorithms.\ud Results: We present a sound framework for evaluating the quality of seeds for oligonucleotide design. The F-score is used to measure the accuracy of each seed. A number of natural candidates are tested: contiguous (BLAST-like), spaced, transitions-constrained, and multiple spaced seeds. Multiple spaced seeds are the best, with more seeds providing better accuracy. Single spaced and transition seeds are very close whereas, as expected, contiguous seeds come last. Increased accuracy comes at the price of reduced efficiency. An exception is that single spaced and transitions-constrained seeds are both more accurate and more efficient than contiguous ones.\ud Conclusions: Our work confirms another application where multiple spaced seeds perform the best. It will be useful in improving the algorithms for oligonucleotide desig

CiteSeerX

Scholarship@Western

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

SoftPanel: a website for grouping diseases and related disorders for generation of customized panels

Author: A Bravo
A Hamosh
A Liberzon
A Rath
A Subramanian
Cong Zhang
D Croft
D Smedley
D Szklarczyk
GU Ganegoda
J Gillis
Johnathan Watkins
K Lage
KI Goh
LC Tranchevent
Likun Wang
M Ashburner
M Girdea
M Kanehisa
M Oti
MA Driel
Michael McNutt
MJ Bamshad
MJ Cowley
MN Nikiforova
N Gill
RC Deo
S Razick
T Sing
X Wu
X Yao
Y Chen
Y Moreau
Yan Jin
Yuxin Yin
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Text-based phenotypic profiles incorporating biochemical phenotypes of inborn errors of metabolism improve phenomics-based diagnosis

Author: A Sifrim
AA Philippakis
AR Deans
BK Burton
CJ Mungall
Clara D. M. van Karnebeek
D Greene
D Houle
D Smedley
HG Brunner
J Amberger
JA Blake
Jake Lever
Jessica J. Y. Lee
JS Amberger
JV Leonard
JX Chong
KM Boycott
LG Biesecker
M Girdea
Michael M. Gottlieb
MM Gottlieb
MP Fay
Nenad Blau
PN Robinson
RCM Hennekam
S Köhler
S Köhler
Steven J. M. Jones
WP Bone
Wyeth W. Wasserman
Z Gu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A visual and curatorial approach to clinical variant prioritization and disease gene discovery in genome-wide diagnostics

Background: Genome-wide data are increasingly important in the clinical evaluation of human disease. However, the large number of variants observed in individual patients challenges the efficiency and accuracy of diagnostic review. Recent work has shown that systematic integration of clinical phenotype data with genotype information can improve diagnostic workflows and prioritization of filtered rare variants. We have developed visually interactive, analytically transparent analysis software that leverages existing disease catalogs, such as the Online Mendelian Inheritance in Man database (OMIM) and the Human Phenotype Ontology (HPO), to integrate patient phenotype and variant data into ranked diagnostic alternatives. Methods: Our tool, “OMIM Explorer” (http://www.omimexplorer.com), extends the biomedical application of semantic similarity methods beyond those reported in previous studies. The tool also provides a simple interface for translating free-text clinical notes into HPO terms, enabling clinical providers and geneticists to contribute phenotypes to the diagnostic process. The visual approach uses semantic similarity with multidimensional scaling to collapse high-dimensional phenotype and genotype data from an individual into a graphical format that contextualizes the patient within a low-dimensional disease map. The map proposes a differential diagnosis and algorithmically suggests potential alternatives for phenotype queries—in essence, generating a computationally assisted differential diagnosis informed by the individual’s personal genome. Visual interactivity allows the user to filter and update variant rankings by interacting with intermediate results. The tool also implements an adaptive approach for disease gene discovery based on patient phenotypes. Results: We retrospectively analyzed pilot cohort data from the Baylor Miraca Genetics Laboratory, demonstrating performance of the tool and workflow in the re-analysis of clinical exomes. Our tool assigned to clinically reported variants a median rank of 2, placing causal variants in the top 1 % of filtered candidates across the 47 cohort cases with reported molecular diagnoses of exome variants in OMIM Morbidmap genes. Our tool outperformed Phen-Gen, eXtasy, PhenIX, PHIVE, and hiPHIVE in the prioritization of these clinically reported variants. Conclusions: Our integrative paradigm can improve efficiency and, potentially, the quality of genomic medicine by more effectively utilizing available phenotype information, catalog data, and genomic knowledge

Crossref

Springer - Publisher Connector

PubMed Central

DSpace at Rice University

FigShare

The Human Phenotype Ontology project:linking molecular biology and disease through phenotype data

Author: Bailleul-Forestier Isabelle
Bauer Sebastian
Black Graeme C M
Brown Danielle L
Brudno Michael
Campbell Jennifer
de Vries Bert B A
Doelken Sandra C
Eppig Janan T
Firth Helen V
Fitzpatrick David R
Freson Kathleen
Girdea Marta
Gkoutos Georgios V
Haendel Melissa
Helbig Ingo
Hurst Jane A
Jackson Andrew P
Jackson Laird G
Jähn Johanna
Kelly Anne M
Köhler Sebastian
Ledbetter David H
Leeuw Nicole de
Lewis Suzanna E
Mansour Sahar
Martin Christa L
Moss Celia
Mumford Andrew
Mungall Christopher J
Ouwehand Willem H
Park Soo-Mi
Riggs Erin Rooney
Robinson Peter N
Ruef Barbara J
Schofield Paul
Scott Richard H
Sisodiya Sanjay
Smedley Damian
Smith Cynthia L
Vooren Steven Van
Vulto-van Silfhout Anneke T
Wapner Ronald J
Washingthon Nicole L
Westerfield Monte
Wilkie Andrew O M
Wright Caroline F
Publication venue
Publication date: 01/01/2013
Field of study

The Human Phenotype Ontology (HPO) project, available at http://www.human-phenotype-ontology.org, provides a structured, comprehensive and well-defined set of 10,088 classes (terms) describing human phenotypic abnormalities and 13,326 subclass relations between the HPO classes. In addition we have developed logical definitions for 46% of all HPO classes using terms from ontologies for anatomy, cell types, function, embryology, pathology and other domains. This allows interoperability with several resources, especially those containing phenotype information on model organisms such as mouse and zebrafish. Here we describe the updated HPO database, which provides annotations of 7,278 human hereditary syndromes listed in OMIM, Orphanet and DECIPHER to classes of the HPO. Various meta-attributes such as frequency, references and negations are associated with each annotation. Several large-scale projects worldwide utilize the HPO for describing phenotype information in their datasets. We have therefore generated equivalence mappings to other phenotype vocabularies such as LDDB, Orphanet, MedDRA, UMLS and phenoDB, allowing integration of existing datasets and interoperability with multiple biomedical resources. We have created various ways to access the HPO database content using flat files, a MySQL database, and Web-based tools. All data and documentation on the HPO project can be found online

Aberystwyth Research Portal

The Jackson Laboratory: The Mouseion at the JAXlibrary

University of Birmingham Research Portal

Edinburgh Research Explorer

The University of Manchester - Institutional Repository

PubMed Central

Oxford University Research Archive

Open Research Exeter

Radboud Repository

St George's Online Research Archive

Explore Bristol Research

RD-Connect, NeurOmics and EURenOmics: collaborative European initiative for rare diseases

Functional Genomics of Muscle, Nerve and Brain Disorder

Crossref

ZENODO

Leiden University Scholary Publications

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

UPF Digital Repository

Meeting the challenges of implementing rapid genomic testing in acute pediatric care

Author: Alison Yeung
Anna Jarmolowicz
Belinda Chong
CC van Diemen
CL Gaff
Clara L Gaff
DA Chambers
Dean G Phelan
Gemma R Brett
GR Monroe
H Daoud
JE Petrikin
Jessica R Riseley
Justine E Marum
Justine Elliott
KD Farwell
LA Frankel
LE Vissers
LJ Damschroder
LK Willig
M Girdea
M Walsh
Matthew Hunter
Matthew Regan
MC Roberts
Melissa Martyn
Miriam Fanjul-Fernandez
NA Miller
Natalie B Tan
Rachel Stapleton
S Richards
SE Soden
Sebastian Lunke
SF Kingsmore
Smitha Kumble
SP Sadedin
Stephanie Best
Susan M White
TA Manolio
TA Manolio
Tiong Y Tan
TY Tan
WM Cohen
Yael Prawer
Z Stark
Z Stark
Z Stark
Z Stark
Zornitza Stark
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/03/2018
Field of study

Authors: Stark, Z. Lunke, S. Brett, G. Tan, N. Stapleton, R. Kumble, S. Yeung, A. Phelan, D. Chong, B. Fernandez, M.F. Marum, J.E. Hunter, M. Jarmolowicz, A. Yael Prawer, Y. Riseley, J.R. Regan, M. Elliott, J. Melissa Martyn, M. Best, S. Tan, T. Clara L Gaff, C.L. and White, S.M

Crossref

Cronfa at Swansea University